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USEFUL BOUNDS ON THE EXTREME EIGENVALUES AND 
VECTORS OF MATRICES FOR HARPER’S OPERATORS 

DANIEL BUMP, PERSI DIACONIS, ANGELA HICKS, LAURENT MICLO, 

AND HAROLD WIDOM 


Abstract. In analyzing a simple random walk on the Heisenberg group we en¬ 
counter the problem of bounding the extreme eigenvalues of an n x n matrix of 
the form M = C -\- D where C is a circulant and D a diagonal matrix. The dis¬ 
crete Schrodinger operators are an interesting special case. The Weyl and Horn 
bounds are not useful here. This paper develops three different approaches to 
getting good bounds. The first uses the geometry of the eigenspaces of C and 
D, applying a discrete version of the uncertainty principle. The second shows 
that, in a useful limit, the matrix M tends to the harmonic oscillator on L^(E) 
and the known eigenstructure can be transferred back. The third approach is 
purely probabilistic, extending M to an absorbing Markov chain and using 
hitting time arguments to bound the Dirichlet eigenvalues. The approaches 
allow generalization to other walks on other groups. 


November 10, 2015 

1. Introduction 


Consider the n x n matrix 


( 1 ) 



As explained in and summarized in Section]^ this matrix arises as the Fourier 
transform of a simple random walk on the Heisenberg group, as a discrete approx¬ 
imation to Harper’s operator in solid state physics and in understanding the Fast 
Fourier Transform. Write M = C + D with C a circulant, (having | on the diago¬ 
nals just above and below the main diagonal and in the corners) and D a diagonal 
matrix (with diagonal entries ^ cos ) for 0 < j < n — 1). The Weyl bounds [20] 
and Horn’s extensions |2] yield that the largest eigenvalue Xi{M) < Ai(C') -I-Ai(Zl). 
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Here Ai(C') = Xi{D) = ^ giving Ai(M) < 1. This was not useful in our applica¬ 
tion; in particular, we need Ai(M) < 1 — This paper presents three different 

approaches to proving such bounds. The first approach uses the geometry of the 
eigenvectors and a discrete version of the Heisenberg uncertainty principle. It works 
for general Hermitian circulants: 


Theorem 1. Let C be an n x n Hermitian circulant with eigenvalues Ai(C') > 
■ ■ ■ ^ A„(C'). Let D be an n X n real diagonal matrix with eigenvalues \i{D) > 
• • • > ^n{D)- If k, k' satisfy 1 < k,k' < n, kk' < n, then 

Xi{C + D) < Xi{C) + Xi{D) 

- imin{Aip)-Afep),Ai(C)-Afe.(C)} U 



Example 1. For the matrix M„ in 0. the eigenvalues of C and D are real and 
equal to { 5 cos (^) For simplicity, take n odd. Then, writing Xj = 

Xj{C) = Xj{D), Xi = I, X 2 = X 3 = I cos (^), and A 2 j+i = X 2 j = 5 cos (^) for 
1 < j < Choose k = k' = for a hxed 0 < c < 1. Then 


Afe = 


1 

2 






and the bound in Theorem [T] becomes 


Ai(M„) < 1- 


^2 „2 


{l-cf 



The choice c = ^ gives the best result. Very sharp inequalities for the largest and 
smallest eigenvalues of follow from [3] . They get better constants than we have 
in this example. Their techniques make sustained careful use of the exact form of 
the matrix entries while the techniques in Theorem work for general circulants. 


The second approach passes to the large n limit, showing that the largest eigen¬ 
values of M„ from Q tend to suitably scaled eigenvalues of the harmonic oscillator 


Theorem 2. For a fixed k >1, the kth largest eigenvalue of equals 

I 

with smallest eigenvalue of L. 


1-ho 


Theorem]^ gets higher eigenvalues with sharp constants for a restricted family of 
matrices. The argument also gives a useful approximation to the fcth eigenvector. 
Similar results (with very different proofs) are in [5S] . 

There are many techniques available for bounding the eigenvalues of stochastic 
matrices ( |24j . m, and Cl)- We initially thought that some of these would adapt to 
M„. However, is far from stochastic: the row sums of are not constant and 
the entries are sometimes negative. Our third approach is to let = ^L + ^Mn- 
This is substochastic (having non-negative entries and row sums at most 1). If 
Qi = 1 — J2j Xl'nifj)^ consider the (n -h I) x (n -h 1) stochastic matrix: 
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(2) 


M” = 


1 

ai 

02 


0 0 


M' 

n 


This has the interpretation of an absorbing Markov chain (0 is the absorbing 
state) and the Dirichlet eigenvalues of M" (namely those whose eigenvalues vanish 
at 0) are the eigenvalues of M!^. In path and other geometric techniques are used 
to bound these Dirichlet eigenvalues. This results in bounds of the form 1— for 
Xi{Mn). While sufficient for the application, it is natural to want an improvement 
that gets the right order. Our third approach introduces a purely probabilistic 
technique which works to give bounds of the right order for a variety of similar 
matrices. 


Theorem 3. There is a c> Q such that, for all n > 1 and Mn defined at the 
largest eigenvalue satisfies Ai(M„) < 1 — 

Section gives background and motivation. Theorems 0i and are proved 
in Sections iS and Section treats a simple random walk on the affine group 
mod p. It uses the analytic bounds to show that order steps are necessary and 
sufficient for convergence. It may be consulted now for further motivation. The 
final section gives the limiting distribution of the bulk of the spectrum of M„(a) 
using the Kac-Murdock-Szego theorem. 


2. Background 

Our work in this area starts with the finite Heisenberg group: 

^fi(n) = I ^0 1 : x,y,zGZ/nZ^. 

Write such a matrix as {x, y, z), so 

{x, y, z){x\ y', z') = (a; + a;', y + y', z ^ z' + xy'). 


Let 

(3) 


5 = {(1,0,0), (-1,0, 0), (0,1, 0), (0,-1, 0)} and 


(4) 


Q{9) 


g e S 
otherwise 


Thus S' is a minimal symmetric generating set for Hi{n) and Q is the probability 
measure associated with ‘pick an element in S at random and multiply.’ Repeated 
steps of this walk correspond to convolution. For {x,y,z) G Hi{n), 


Q* {x,y,z) 


E 


Qix',y',z')Q* 


{{x,y,z){x',y',z') ^). 


{x' ,y' 
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When k is large, Q*^ converges to the uniform distribution U{x,y,z) = The 

fc 

rate of convergence of Q* to U can be measured by the chi-squared distance: 


(5) 




{x,y,z)GH 


pGffi 


On the right, the sum is over nontrivial irreducible representations of Hi(n) with p 
of dimension dp and Q{pY = y z) Q* Y- For background on the 

Fourier analysis approach to bounded convergence see |H], Chapter 3. 

For simplicity, (see for the general case) take n = p a prime. Then i?i(p) 
has 1-dimensional representations Pa,b{x, y, z) = for a, b inZp. It has 

p — 1 p-dimensional representations. These act on V = {f : Zp ^ C} via 

Paix, y, z)f{w) = + w), 0<a<p-l. 

The Fourier transform of Q at pa is the matrix Mn{a) as in 

replaced by cos for 0 < j < p — 1 . 

The chi-squared norm in ([^ is the sum of the (2fc)th power of the eigenvalues 
so proceeding needs bounds on these. The details are carried out in [5]. The main 
results show that k of order rY steps are necessary and sufficient for convergence. 
That paper also summarizes other appearances of the matrices M„(a). They occur 
in discrete approximations of the ‘almost Mathieu’ operator in solid state physics. 
In particular, see ED, 0, and 0. If J^n is the discrete Fourier transform matrix 
d-n)jk = it is easy to see that Diagonalizing 

J^n has engineering applications and having a ‘nice’ commuting matrix should help. 
For this reason, there is engineering interest in the eigenvalues and vectors of M„(l). 
See [H] and [2H] . 


) with cos 


(w) 


3. Proof of Theorem [T] 

Throughout this section C is an n x n Hermitian circulant with eigenvalues 
> A 2 (C') > ■• • > Xn{C) and D is a real diagonal matrix with eigenvalues 
Ai(D) > \2{D) > • • • > Xn{D). Let x be an eigenvector of C -|- D corresponding 
to Xi{C + D). Recall that {{Fn)jk = ^ ZjnZ. This has rows 

or columns which simultaneously diagonalize all circulants. Write x = T^x and x^ 
for the conjugate transpose. We use ||x|p = x^x. 

Our aim is to prove that for kk' < n, 

(6) Ai(C + D) < Ai(C) + Ai(D) 

1 ( I kk' 

- - min{Ai(D) - Afc+i(D), Ai(C) - Afc.+i(C)} I 1 " ^ — 

The first step is to write x^Cx in terms of a Fourier transform pair x = TnX. 
A subtle point is that although Tn diagonalizes (7, the resulting diagonal matrix 
does not necessarily have entries in decreasing order, necessitating a permutation 
indexing in the following lemma. 
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Lemma 1. Define a permutation a such that 

2TTi —11 

e^D b 

- 7 =—, 0 < 6 < n — 1 

i/n 

is the eigenvector corresponding to \j{C). Then 
(7) x^Cx = x^D'x 

with D' = diagiXfji (C ),..., Acr„ (C')). 

Proof. Since C is diagonalized by = D'. Thus 

X^CX = X^F'fFnCFllFnX = x^D'x. 


□ 


A key tool is the Donoho-Stark m version of the Heisenberg Uncertainty Prin¬ 
ciple. For this, call a vector y ‘e-concentrated on a set S' C [n\ if \xi\ < e for 
^ ^ S. 

Theorem 4 (Donoho-Stark). Let y,ybea unit norm Fourier Transform pair with 
y es-concentrated on S and y ct- concentrated on T. Then 

( 8 ) \s\\T\>n{l-{es + eT)r- 

Let (y)s be the projection onto the subspace vanishing off S: 

{(.y)s)^ otherwise ’ 

A simple consequence of the bound ^ is 

Corollary 1. If kk' < n, z, z a, unit norm Fourier transform pair and S and T are 
sets of size k, k', then 

llWscf + ||(z)Tef . 


Proof. Let es = IK^js^H and ct = ||(2 )t'’||- Then ||z - (z)s|| = ||(z)s<’|| and 
\\z — (z)t|| = ||(-2)r=||- Thus z is es concentrated on S and z is ex concentrated on 
T. Thus 0 gives 


I'S'IITI kk' ^ , 

-= - > (1 — (es + e*))^ or (es + ct ) > 1 

n n 



so if kk' < n 




{z)l 


P — (el + el) ^ ~ 2 ( ^ 



2 


□ 


Proof of Theorem [7} With notation as above, 

Ai(C + D)= x''{C + D)x = -k x^Dx = x^D'x + x^Dx =: *. 
Let D = D - Ai(D)/ and W = D' - Ai(C')/. Then 

* = x^Xi{C)Ix -k x^D'x -k x^Xi{D)Ix + x^Dx 
= Ai(C') -k x^D'x -k Ai(D) -k x^Dx 
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Now D and D' have non-positive eigenvalues so our improvement over the Weyl 
bounds will follow by showing that x or x have support on suitably negative entries 

of d' otD. _ 

Let S and T correspond to the largest fc, k' entries oi D, D , respectively. Then 
X = (x)s + (x)s'=, X = {x)t + {x)t'=- Each of those decomposition is into orthogonal 
pieces. Multiplying any of the four pieces by an arbitrary diagonal matrix preserves 
this orthogonality. Thus 

* = Ai(C) + Alp) + {x)'^'U{x)t + {x)!^oW{x)t^ + {xfsD{x)s + {x)'^s‘=Dix)s^. 

For the last four terms on the right, terms 1 and 3 are bounded above by zero and 
2 and 4 contribute with the following bounds: 

* < Ai(C) + Alp) + (Afe+ip) - Aip))||(x)scf + (Afe.+i(C) - Ai(C)||Prcf 
< Ai(C) + Aip) 


where the last line follows from the corollary. □ 



+ min{(Afc+ip) - Alp)), (Afc.+i(C) - Ai(C)} 


Remarks. (1) These arguments work to give the smallest eigenvalue as well, 
so in fact we also have for W < n: 


(9) A„(C + p > A„(C) + A„p) 


-f - min{A„_/(i:)) - \n{D), X^-v{C) - A„(C)} ( 1 



(2) Our thanks to a thoughtful anonymous reviewer, who pointed out that 
Corollary can be improved using Cauchy- Schwartz to show that for 0 < 


a,b < 1, 



^ (cs + £t)^ ^ "b 



Setting a = Xk+i{D) — Xi{D) and b = Xk+i{D) — Ai(D), one can improve 
the previous theorem: 


Xi{C + D) < Alp) + Alp) 

(Alp) - Afc+ip))(Aip) - Afc>+ip)) ( 

{Xi{D) - Xk+i{D)) + (Ai(C) - Afc-+i(C)) 

In our case, the result is the same, since the eigenvalues of C and D are 
identical. 

(3) Donoho and Stark |15j give many variations on their uncertainty principle 
suitable for other transforms. The techniques above should generalize, at 
least to the G-circulants of [9]. 

(4) There should be similar theorems with C and D replaced by general Hermit- 
ian matrices and perhaps extensions to higher Weyl and Horn inequalities 
(see [5] and [TB]!. 

(5) Further applications/examples are in Section]^ 
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4. The Harmonic oscillator as a limit. 


We prove Theorem 2, that for fc > 1 the fcth largest eigenvalue of M„ is equal to 
1 — /r/c/n + o(l/n) and the fcth smallest eigenvalue of is equal to — 1 + ^fc/n + 
o(l/n), where is the fcth smallest eigenvalue of 

r 1 2 2 

on (—00,00). By a classical computation (see [19]), /Xfc = (2fc-1)/(2T). 

The n X n matrix has j, fc-entry 

i [S{j - k - 1) + 6{j - k + 1)] + ^ cos{2TTk/n)5{j - k), 


where j, k £ Zn = TLjnL. 

We define 

Mn = n{I - Mn). 

This has j, k entry mi{j, k) + m 2 {j, k), where 


( 10 ) 


miij, k) = - [ S{j -k)- - [S{j - k - 1) + S{j - k + 1)] 


Tl 

(11) m 2 {j,k) = - (1 - cos(27rfc/n))(5(j - fc). 

We will show first that if fjL is any limit of eigenvalues of Mn then /i is an 
eigenvalue of L; and, second, that any eigenvalue /i of L has a neighborhood that 
contains exactly one eigenvalue, counting multiplicity, of Mn for n sufficiently large. 
These imply the stated result. 

These will be accomplished as follows. Give each point of Z„ measure l/v^j 
so the total measure equals y/n. We then define an isometry T from L^(Z„) to 
L^(—-\/n/2,1/71/2) (thought of as a subspace of L^(M) with Lebesgue measure) for 
which the following hold: 


Proposition 1. Suppose {u„} is a sequence of functions of norm one in L'^ifLn) 
such that the sequence {(M„ ii„, it„)} of inner products is bounded. Then {Tun} 
has a strongly (i.e., in norm) convergent subsequence. 


Proposition 2. If (f is a Schwartz function on K then TMnT*(t> —t L4> siro7i(iZi/0 

These will easily give the desired results. (See Propositions and near the 
end.) The final section 4.2 treats the smallest eigenvalues. 


4.1. Proofs for the largest eigenvalues. We use two transforms (with, confus¬ 
ingly, the same notation). First, for / in L^(—Y^n/2, y/nf2) we define 
^ /■Wt/2 

(){£)= e-'^”^^^'^f){x)dx, (£gZ), 

J —y/n{2 

and we have by Parseval (after making the substitution x 1—>■ x^/n in the integral) 

(12) ll/ll =ni/4||<^||. 

Here ||^f = 


^The operator T* acts on L‘^{—y/nl2^ so (p is first to be restricted to (—■y/n/2, ■y/n/2). 
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For u G L'^{Zn) we have its finite Fourier transform 

= (£eZ„), 


and we compute below that 
(13) 

Here 


IImII = Hull. 


i«f = Ei^wi' 


the sum over any integer interval of length n. To show (131, we have 

„-27ri£(i-fc)/n ^ 
j,k 


| u (£)|2 = 


Since e is an nth root of unity, equal to 1 only when j = /c in Z„, we get 

iim|P = E ^ E =' 


n^/'llulP. 


Now we define the operator T. Let J be an interval of integers of length n (which 
later will be specified further) and set 


Dn{x) = E® 
eej 


27Ti£x 


Then T is defined by 


{Tu){x) = - E 

n ^ \ yjn n J 


Thus T has kernel 


T{x,k)=^D^(^-^ 
n \ y'n n 

By the definition of the inner product on L^(Z„) we find that 

T* : L2(_y/^/2,7^/2) ^ L2(Z„) 

has kernel 

T*{k,x) = ^D^(^- ^ 

y/n \n vn 

In terms of the transforms we have the following: 

Lemma 2. (a) For u G L^(Z„), 

^uie) ifiGj, 


Tuii) = 


(b) For 4> G L'^{-y/nl2, 


if £ ^ J. 


T*(l){i) = y/n cj){i) when i G J. 
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Proof. For (a), we have 


Tu{i) = - [ 


kJ'eJ 


^ fgj 


The result follows. 

For (b), we have when £ £ J, 




= \/n / e 


— 27 riik}n ^ 27 zii' {k jn—xf -y/n) 

kJ'GJ'^ 


□ 


We show two things about T. For the second we shall assume now and hereafter 
that the end-points of J are ±n/2 + 0(1), although this is a lot stronger than 
necessary. 


Lemma 3. (a) T* T = I. (b) TT* ^ I strongly as n —)■ oo. 

Proof. By Lemma [2>, 

f^{£) = yfnTu{£'), 

where £' & J and t" — £ S nZ. By Lemma[^ this in turn equals u[£'), which equals 
u[£) since u is n-periodic. This gives (a). 

For (b) observe that TT* is self-adjoint. Since {TT*Y = XT* TT* = TT*, it 
is a (nonzero) projection and so has norm one. Therefore it suffices to show that if 
(p is a Schwartz function then TT*(j) —>■ p. We have from Lemma that 

flY>{£) = f*P{£) 

Vn 


if £ S J and equals zero otherwise. If £ € J then by Lemma it equals </>(£). It 
follows that 

= ^|^(£)p. 

ii^j 

Integrating by parts shows that 

P{£) = 0(v^/£), 


and so, by our assumption on J, the sum on the right side is 0(1). Then by (12) 
we get 

\\TT*P-cP\\=0{n-^/^). 

□ 


Now the work begins. First, an identity. We introduce the notations 
0(0 = 1 - cos(27r^), S'(0 = sin(7rO, 


and observe that 0(0 = 2S'(0^- 
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Lemma 4. For u € L^(Z„), 


{MnU,u) = n \\S{£/n)'i 


■ n \\S{k/n) u(fc)|p. 


Note. Here and below we display “fc” as the variable in the ambient space Z„ and 
as the variable in the space Z„ of the Fourier transform. We abuse notation 
and, for example, the “u{k)” above denotes the function k u{k). 


Proof. We consider first the contribution of (10) to the inner product. If we define 
the operators A± by 

{A±u){k) = u{k ± 1), 
we see that the contribution to the inner product is 


Now 


— (u — [Aj^u + H_w]/ 2 , u). 


A±u{^) = 


so if we use (131 we see that the above is equal to 

{C{£/n) u, u{£)) = \\S{(./n) u||^. 

To complete the proof of the lemma we note that the contribution to the inner 
product of (11) is clearly 

— {C{k/n) u{k), u{k)) = n\\S{k/n) u{k)\\‘^. 

□ 

Lemma 5. Suppose Un satisfy {MnU„^Un) = 0(1). Then (a) ||a;ru„(ai)|| = 0(1), 
and (b) ||(Tm„)'|| = 0(1). 

Proof of (a). We have 

Un(e) - U„ie + 1 ) = 5 ] 

k 

the finite Fourier transform of (1 — Un{k). We have, 

1(1 - Un{k)\ = 2\S{k/n) w„(fc)|. 


Therefore from (13) and 

\\S{k/n) Unik)\\ = 0(n“^/^), 

which follows from Lemma we get 

\\uni£) - Un{t +1)11 = 0(n^/^). 

It follows from Lemmathat Tun{() = Tm„(£ + 1) = 0 if both £, £ + 1 ^ J and 

(14) ^ |(ZX,(£)-7X,(^+l)|2 = 0(n-i/"). 

.^,^+lGJ 

li £ G J but £+ 1 ^ J then Tm„(£ + 1) = 0 and £ is the right end-point of J and 
therefore n/2 + 0(1). From 

(15) 


\\S{£/n)un{£)\\ = 0 ( 71 ^/"^), 
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which also follows from Lemma and that |S'(^/n)| is bounded below for I = 
n/2 + 0(1), we have in particular that «„(£) = 0(n^/^), Therefore Tun{t) = 
0 (n“^/^). 


So the bound in (14) holds when the sum is taken over all £ S Z. Since 


TUn{^) 


Tun{^+l) = / e 


— 27r2X. 


1 — e 


— 2Tzix 


TUn(x) dx, 


it follows from ([T^ that 


\\S{x/^/n) Tun{x)\\ 




In the interval of integration \x\ < \/nl2, so \S{x/^Jn)\ is bounded below by a 
constant times \x\/^Jn. This gives (a). □ 


Proof of (b). We have 

{Tun)'{x) = ^ Y, = 


k,eeJ 


ieJ 


Thus Tu'^{t) = 2'Kiiun{i)ln for £ € J, and it follows from ( |12| ) that 

47r 




ieJ 


Now \S{£/n)\ is bounded below by a constant times \t/n\ for ^ e J, so (15) implies 
that 

i&j 

which gives the result. □ 


Proof of Proposition^ Since T is an isometry each ||ru„|| = 1, and by passing to 
a subsequence we may assume {Tm„} converges weakly to some / S L^(IR). We 
use the fact that strong convergence will follow if we can show that ||/|| > 1. (In 
general, if ||/n|| = 1 and /«—>■/ weakly, then |j/|| > 1 implies that /«—>■/ strongly. 
Here is the argument. We have that 

ll/n-/f = ||/nf+ ||/f-2Re(/„,/). 

By weak convergence, (/«,/) —>■ ||/|p. Therefore 

ll/„-/f ^1-ll/f <0, 

so ll/n - /II ^ 0.) 

The hypothesis of Lemmais satisfied. It follows from Lemma that for each 
£ > 0 there is a bounded interval A such that 


11(1 - XA)TUn\\ < S 

for all n. So \\Xa Tun\\ > 1 — e. It follows from Lemma Ep that {Tun} is equicon- 
tinuous, and this combined with HxAT’Mnll < 1 shows that a subsequence of {Tm„} 
converges uniformly on A (to Xa /), and so ||Xa /|| > 1 — £• Thus, ||/|| >1. □ 

Proof of Proposition^ Consider first the operator corresponding to rui in ( [To| ). 
We call it nA^. (The subscript indicates that it acts on functions of A:.) We show 
first that 


nTAl T*4>^-(j)"/^ 
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in We have 


n{TAlT*4>){x) 

_ 1 
^ 2ni/2 


^ ^ ^Tzi l{{x jy/n—k jn) 



Hy) dy 


The exponent in the integral is a function of k/y/n — y. So taking the second 
difference in k is the same as taking the second difference in y as long as the 
differences in the y-variable are 1 /i/n. With this understanding, the above equals 


1 

2ni/2 


'y ' ^2'7ri (((x/^/n—k/n) 



^2 {{kjn-yl 


<P{y) dy. 


By changing variables in two of the three summands from Ay we can put the Ay 
in front of the 4>{y). There is an error because of the little change of integration 
domains but (for cj) a Schwartz function) this is a rapidly decreasing function of n, 
and so can be ignored. After this what we get is nT T* A^ (j). Taylor’s theorem 
gives 

n{Al(l)){x) = -(j)''{x)/A + 0 |(/'"'(2/)|) , 

V \v-x\<\l^ j 

from which it follows that n{A^(j)){x) —>■ —(/)"(x)/4 strongly. Since TT* —>• I 
strongly we deduce that nT A\ T*(j) —)• —(f)" {2 strongly. 

Lastly, consider the operator corresponding to which is multiplication by 
nC'(fc/n)/2 = n(l — cos(27r/c/n))/2. For convenience we call this operator C'„/2. 

By Lemma we know that T*(j){£) = y/n(j){i) when £ G J. In general, 

Cn u{£) = n ^(1 — cos(27rfc/n)) u{k) = n (A^ u){£), 

k 


where here 

(A^ u){£) = u{£) - [u{£ - 1) + u{£ + l)]/2. 
Applying this to T*(j} gives 


T*(!>{£) =nA^ T* (/){£) = A^ ^{£) 


/: 


e^" 4>{x)dx, 


as long as £ ± 1 are also in J. If £ is in J but one of £ ± 1 is not in J then £ is near 
an end-point of J and the error committed will be rapidly decreasing as n —>■ oo. 
For the right side is times a linear combination of integrals like 

r•^A^/2 

gi'Triay'n ^ 

/ —y'n/2 

and integration by parts many time shows this is rapidly decreasing. For the left 
side, if for example £ -I- 1 ^ J then it is the same as the value at £ -I- 1 — n S J, 
which is rapidly decreasing. So we ignore this little error and use 

^ /•y"/2 

A'^(j){£)= 

J -^12 


e - 'C{x/y/n) cj){x) dx. 


We also have T*x'^(j){£) = ^/nx'^(f){t} for £ S J. Thus (ignoring the error), 

f'y/nj2 


Cn T*^{£) - 271^ T*x^(t){£) = Vn 


^—27Tix£f y/n I 


' —■s/n/2 


e - '" {nC{x/\/n) — 2 'k‘^x‘^) (j){x) dx. 
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From this we get 

\\CnT*(j)— 2 'k'^ T*x'^(j)\\ < ||(n C{xj^/n) — 2 tt'^x'^) </>(a:)|| = 

since nC{x/y/n) — 2'k'^x^ = Oix'^jn) and x'^ (p{x) G L^(K). Then we get from ( |l^ 
||C„ T*(l) - 271^ T*x‘^cl)\\ = O(n-i). 

Equivalently, 


||rc'„r>-27r2rr*a;2<^|| =0{n-^). 
Since TT* -G I strongly, this gives 

\\TCnT*<h-2Tr‘^x‘^M ->■ 0 . 


Since the operator corresponding to (11) is multiplication by Cnl‘2-, this completes 
the proof of Proposition!^ □ 

Now we go back to the eigenvalues of M„ and easy consequences of Propositions 

[Hand [21 


Proposition 3. (a) If Xn are eigenvalues of Mn and A„ —> /i, then is an eigen¬ 
value of L. (b) Any eigenvalue fj, of L has a neighborhood that contains at most 
one eigenvalue (counting multiplicity) of Mn for sufficiently large n. 


Proof of (a). Suppose that Un is an eigenvector of Mn of norm one with eigenvalue 
Xn- In particular {MnUn,Un) = Xn- By Proposition there is a subsequence of 
{Tun} that converges strongly to some / G For a Schwartz function ^ we 

have 

(/i/, (jf) = lim {Xn Tun, (j>) = lim (jf) 

= lim {un, MnT*(j)) = lim {Tun, 

since T is an isometry. By Proposition [^ TMnT*(f) converges strongly to Lcf- 
Therefor^ the limit equals (/, L^), and we have shown 

p{f,^) = {f,Lc))- 

It follows that / is an eigenfunction of L with corresponding eigenvalue /r. Here 
is why. The eigenfunctions of L are the harmonic oscillator wave functions (pi, and 
therefore Schwartz functions. If the corresponding eigenvalues are p-i, then 

(m - Mi) if, <t>i) = M if, - if, Lpi) = 0 . 

Since the <pi are complete and / 7 ^ 0, some p, — p-k = 0- And /, being orthogonal 
to the (pi with i k, must be a multiple of pk and therefore a corresponding 
eigenfunction. □ 


Proof of (b)- Suppose the contrary were true. Then there would be sequences of 
eigenvalues {A„} and {A^} of M„, both converging to yi, and corresponding orthog¬ 
onal (since Mn is self-adjoint) eigenfunctions and Un- The strong (sub)limits / 
and /' of Tun and Tu'n would be mutually orthogonal eigenfunctions of L corre¬ 
sponding to the same eigenvalue p, of L. Since the eigenvalues of L are simple, this 
cannot happen. □ 


^For this we need only weak convergence of one and strong convergence of the other. But we 
also need that / 7 ^ 0, which is no easier to show than strong convergence of Tu„, and we shall 
need strong convergence of TMnT*(f> for Proposition 4 
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Proposition 4. For each eigenvalue fj, of L there is a sequence of eigenvalues A„ 
of Mn that converges to /i. 

Proof. With corresponding eigenfunction (f oi L we have, by Proposition 2, 
\\TMr.T*(j>- = \\TM^T*<f-L4>\\=o{l). 

From this, and that \\(j) — TT* (fW = o(l) by Lemma we get 

= o(i). 

Since T is an isometry this is the same as 

\\M^T*<f- giT* f>\\ = o{l). 

Since ||T*(/)|| = |jTT*(/)|| —>• ||(/)|| 0, and the other eigenvalues of L are bounded 

away from /i, this implies that /r is within o(l) of an eigenvalue of and T* (j) 
within o(l) of an eigenvector. See [TT]. □ 

Combining Propositions and [^ shows that the fcth largest eigenvalue of M„ 
equals 1 — fik/n + o(l/n). 


4.2. The bottom eigenvalues of M„. We shall find a unitary operator U on 
Lf{'Ln) such that the quadratic form for n{I+UnMnU*) is the same as for n(/—M„) 
when n is even and close to it when n is odd. From that it will follow that the kih. 
bottom eigenvalue of equals — 1 + + o(l/n). 

Recall that Lemma says that 

{n{I — Mn) u, u) = \\S{i/n) u|p + n \\S{k/n) u{k)\\'^ =: Q{u), 

where S{f) = sin( 7 r^). For this we used the identity 1 — cos^ = 2sin^(^/2). For 
n{I + Mn) this gets replaced by 1 + cos^ = 2cos^(^/2). So now we define 

C'(^) = cos( 7 r^), 

and get 

{n{I + Mn) u, u) = ||C'(£/n) u||^ + n \\C{k/n) u{k)\\'^. 


We consider first the less straightforward case of n odd and define 
v{k) = (n+ l)/ 2 ), 

where a (real) will be determined below. We have 

\\C{k/n) u{k)\\ = ||C'(fc/n + l/2 + l/2n) v{k) = \\S{k/n + l/2n) v(A:)||. 


Next, 

v{£) = uik-{n + l)/ 2 ) 

k 

— g-27ri(fc+(n+l)/2)^/n g2'n-ia(fc+(n+l)/2) 

k 

_ ^Tri^/n g'7ria(n+l) E ^-2'KiMln ^2-Kiak 

k 

We choose a = — (n + l)/2n. The factor outside the sum has absolute value 1 while 
the sum becomes u(f+(n + l)/ 2 ). Alternatively, 

\u{e)\ = \v{£-{n + l)/2)\. 
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Therefore 

\\C(^/n)u{^)\\ = \\C{e/n +1/2+ l/2n)v[e)\\ = \\S{e/n + l/2n)v[£)\\. 

The map U : u ^ v is unitary and we have shown 

Lemma 6. For n odd we have, 

{n{I + UMnU*)v, v) = \\S{£/n + l/2n) + n\\S{k/n + l/2n) v{k)\\'^. 

Remark. . When n is even we replace the shift {n + l)/2 by n/2 and a by —1/2, 
and the extra l/2n’s do not appear in the arguments of the S's. The quadratic 
form becomes Q{v) exactly, so UnMnU* = — 

Proposition 5. (n(/ + UMnU*) v, w) = (1 + Q{v) + 0{n~^/'^\\v\\‘^). 

Proof. Since dS/d^ is bounded, 

||S'(fc/n + l/2n) r(fc)|| < ||S'(fc/n) w(A:)|| + 0(n“^||u(fc)||). 

It follows from the arithmetic-geometric mean inequality that for any £ > 0 
\a + b\^ < (l+e)|a|2 + (l + e-i)|6 |2. 

We will take e —)■ 0 as n —)■ oo, so we obtain 

\\S{k/n + l/2n) v{k)f < (1 + e) \\S{k/n) v{k)f + 0{e-\-^\\v{k)f). 
Similarly, 

\\S{£/n + l/2n) rf < {1 + e) \\S{£/n) z;f + 

= (1 + £) \\S{£/n)v\\'^ + ), 

where we used ||u|| = ||i;||. 

Thus, 

(n(/ -I- UMnU*) V, w) < (1 -I- e) Q{v) + ||?;|p). 

Similarly, 

{n{I + UMnU*)v, t) > (1 -I- £)~^ Q{v) - 0(£“^n“^||w|p). 

We set £ = and put the inequalities together to get the statement of the 

proposition. □ 

Recall that Q{v) = (n(J — M„) v,v). If in the statement of the proposition we 
take the minimum of both sides over all v with ||i;|| = 1 we deduce that 

n + nXn = (1 + 0(n“^/^))(n — nXi) + 

where A„ is the bottom eigenvalue of and Ai the top eigenvalue. Since Ai = 

1 — fJ,i/n + o(l/n), we have n — nAi = + o(l), and then n -I- nA„ = fii + o(l), 
and then A„ = —1 -f fii/n -|- o(l). 

Using the minimax characterization of the eigenvalues we show similarly that 
Xn-k+i = —l+fJ-k/n + o{l) for each k. 


^This is easy to see directly. 
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5. A Stochastic Argument 


This section gives a bound on the largest eigenvalue of the matrix using a 
probabilistic argument. By inspection, 

m; = 

is a sub-stochastic matrix (with non-negative entries and row sums at most 1). Take 
M" as in ([^, an (n -|- 1) x (n -I- 1) stochastic matrix corresponding to a Markov 
chain absorbing at 0. The first (Dirichlet) eigenvector has first entry 0 and its 
corresponding eigenvalue /3* is the top eigenvalue of Thus 


/3 = 


(/^* - 5 ) 

(2/3) 


3,.* 1 

= -B - 

2 ^ 2 


is the top eigenvector of 

We will work in continuous time, thus, for any transition matrix M, 


Mt 


= E 

1=0 




r- 


The matrix L = I — M, the opposite of the generator of the semigroup {Mt}t>o, 
has row sums zero, and non-positive off diagonal entries]^ If v is a right eigenvector 
of M with eigenvalue /3, then v is an eigenvector of L with eigenvalue 1 — /?. A lower 
bound for the non trivial eigenvalues of L gives an upper bound for the eigenvalues 
of M. Throughout, we specialize to L = I — M„, let A* be the lowest non-zero 
eigenvalue of L, and (3 the highest eigenvector of Mn- 

Standard theory for absorbing Markov chains with all non-absorbing states con¬ 
nected shows that if r is the time to first absorption, for any non absorbing state 
as t tends to infinity. 


lim 

>-oo 


-logPg(r > t) _ 


Thus an upper bound on /? will follow from an upper bound on (t > t). 
Here is an outline of the proof. Begin by coupling the absorbing chain of interest 
with a simple random walk on = Z/(nZ). For a fixed b, let be the first 
time that the simple random walk travels ±& from its start. We derive the bound 
P^{t > Tb) < Gb, where Gf, < 1 is a particular constant described below. Define a 
sequence of stopping times as follows, t/ = Tb, is the first time following r/ 
that the walk travels ±5, similarly define r™. By the strong law of large numbers, 
T^/m —>■ = A(t/) almost surely. Thus 


P{t > rrifib} = P{t > t™}. 


Using the Markov property, P{t > r™} < G™. This implies there are positive Ci, 
C 2 with 

P{t > Cim^ib} < G™ + 


^Since some of our readers (indeed some of our authors) may not be probabilists we insert the 
following note; given any matrix L{x,y) with row sums zero and non-positive off diagonal entries 
one may construct a continuous time Markov process W = {Wt)t>o follows. Suppose Wq = wq 
is fixed. The process stays at wq for an exponential time ctq with mean l/\L{wo,wo)\- (Thus 
P{cro > = Q-tL{wo,wo) Then, choose wi 7 ^ wq with probability \L{wq,wi)\/L{wq,wq). 

Stay at wi for an exponential time cri (with mean 1/L{wi,wi)). Continue, choosing from 
L(iui, .)/L{wi,wi). 
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In our problem, classical random walk estimates show fii, ~ b^. We show, for 
b = y/n, is bounded away from one. Thus 


P{r > ciTO/Xb} < 2max(G'™, e 

and 

logPjr > ciirifib} ^ c' c" 

~ Hb n 

for some c', c" < 0. Backtracking gives the claimed bound in Theorem 

The argument is fairly robust—it works for a variety of diagonal entries. At the 
end of the proof, some additions are suggested which should give the right constant 
multiplying 

We begin by constructing two processes. For as long as possible, general absorp¬ 
tion rates will be used. Let X = {xt)t>o be the standard continuous time random 
walk on Z with jump rates 1 between neighbors. Take Xq = 0. Fix b G Z_|_ and let 
Tb be the first hitting time of {—b — l,b + 1}: 


(16) = inf{t > 0 : |xt I = & -I- 1} 

Let {ux}xez be killing rates, e.g. arbitrary non-negative real numbers. Add a 
cemetery state oo to Z. An absorbed process x = (xt)t>o, behaving as x until it is 
absorbed at oo with the rates {ux}x^z can be constructed as follows: Let £ be an 
independent exponential random variable with mean 1. Define an absorption time 
T e [0, oo] by 

T = inf < / fix ds > £ > . 

‘>0 [Jo J 

As soon as {ux^x^ does not vanish identically, t is characterized by 


(17) 


ds = £. 


More simply, xt = 


for 0 < t < oo. 


Xt if t < r 
00 otherwise 

The two processes are dehned on the same probability space as are r and Tb. 
The first goal is to estimate the chance that t > Tb in terms of the given rates. Our 
bounds are crude but suffice for Theorem [S) 


Proposition 6. With notation as above, for any 6 > 1, 

1 \ 

1 -I- (6 -I- 1) (6 -I- 1 — k)vk J 

with Vk = min{u-k,Uk). 

Note that the bound is achievable; if all Vy = 0 then both sides equal 1. 

Proof. For any fc S Z, < Ufc. Thus if r is the stopping time defined in 0 with 
Uk replaced by Vk, t >t. Therefore it is sufficient to bound P{t > Tb} from above. 
Now, everything is symmetric about zero. Consider the process Y = (Yt)t>o = 
(|xt|)t>o. This is Markov with jump rates: 

(2 iiy = 0,y' = l 

if 2/= Z+ and |y'- ?/| = 1 
I 0 otherwise. 


(18) P{t > Tb} < 


1-b (&-b l)2uo/2 


n 
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Clearly Tf, = inf(>o{yt =6+1}. Define the family of local times associated to Y: 
Ly{t) = f Sy{Ys)ds for y G Z+, t > 0, 

Jo 

where Sy is the indicator function of y. For any t > 0, 

rt 


'^Xgds 


0<y<b 


This gives 

{r > Tfc} = { ^yLyin) < £}■ 

0<y<b 

Taking expectations of both sides with respect to £ 


(19) 

( 20 ) 


> Thl = FI exp I - ^ VyLy{Tb) 

0<y<b 


< Yl E {exp {-{b + l)VyLy{Tb))}>’+^ . 

y=i 

The last bound follows from Holder’s inequality (with 6+1 functions). 

It is well known (see |2S] or Claim 2.4 of [2S] for the discrete time version) that 
for any y, 1 < y < 6 , Ly{Tb) is distributed as an exponential random variable with 
mean ( 6 +1 — y) and Lo{Tb) is exponential with mean (The process leaves zero 
twice as fast as it leaves other points.) Thus, for 1 < y < 6 , 

(6 + 1 - y)"^ 


E{exp{-{b + l)vyLy{Tb)} = 


(6 + 1 - y)“i + (6 + l)i 


F;{exp (-(6 + l)voLo{Tb)} = 
This completes the proof of Proposition 


l + ( 6 +l) 2 uo/ 2 ’ 


□ 


The bound of Propositionj^suggests introducing functions Eb, Gb on . Given 


n 


by 


(21) 

Eb{v) 

(22) 

Gb(v) 


1 + (6 + l)2uo/2 1 + (^ + 1)(^ + 1 “ *)^J 


1 


(b+i) 


, fe =0 


1 + (6 + 1)(6 +1 — fc ) ufc /2 


They have the following crucial monotonicity properties: say that v,v' G 
satisfies v < v' if this is true coordinate-wise. For v G let v be the non¬ 

decreasing rearrangement of v. Then 


(23) 

Ff,(v) < Gb{v) 

(24) 

V < v' ^ G'b(v) > Gh(v') 

(25) 

Gb(v) < Gbiv) 
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Return now to the process underlying Theorem (still keeping the extinction 
rates general.) Let z = {zt)t>o be defined on Z/nZ; it jumps to nearest neighbors at 
rate 1 and is killed with rates u = {u^)^^z/nZ- Suppose Zo = Let Vp, 0 < p < n— 1 
denote the non-decreasing rearrangement of u. Let t be the absorption time of z. 
Fix 6, 0<6<n/2 — 1 and let 


n = inf{Zr G {^-^-1,^ + ^+ Ill- 

Proposition in conjunction with properties (23), (24), and (25), imply that for 
any u, with G&(u) depending only on the first b coordinates of u. 


(26) 


> n] < Gb{u). 


Note that the upper bound is independent of 

Introduce a sequence of further stopping times: and if has been 

constructed. 


(27) ^m+i = inf{t > ■ zt e -b-l,z^^+b+ 1}} 

Informally speaking these stopping times end up being good. Because they cannot 
be larger than r, as in the previous treatment of a random walk on Z/nZ coinciding 
with zt up to the absorption time, then are (almost surely) finite for all m and 
the strong law of large numbers gives: 

lim — ^ p,b = {b+ 1)^ 
m—>-cK rn 

where pb = = (&+1)^ from the Classical Gambler’s Ruin (see Chapter 14 of 

[ 16 ]). 

This suggests that, for m large, the quantities 

Pjc [t > mpb] and [r > 

should behave similarly. Of course, care must be taken because r and are not 
independent. To proceed, we use a large deviations bound for 


Proposition 7. For defined in {27), there are positive constants, Ci, C 2 , inde¬ 
pendent of b and n such that for all m > 1, 


P[£,ni > Cimpb] < e 


Proof. Observe first that this is simply a large deviations bound for the first hitting 
time of the simple random walk (@ SO that n does not enter. The law of is well 
known (^see |22j , |12j , and ini)- It can be represented as a sum of 6 -I-1 independent 
exponential variables with means Oi, 02 ,..., af,+i given by: 


Oi. ^ = 2 I 1 — cos 


7r(2fc — 1) 

2 ( 6 + 1 ) 


Thus for 6 S (0, ai) 


b+l 


=n fh,- 


k=l 


Ok - 0 


By simple calculus, there is c > 0 such that for all a G (0, i], — log(l — a) < ca. 
Thus, for 9 € (0,ai/2], 
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Taking 9 = ^, 

Note that a„ is of order (njh)'^ so the right side of the last inequality is bounded 
uniformly in 6, say by A: > 1. Now, is a sum of m i.i.d. random variables so for 
any ci > 0 

> CiUlfib] < < g-m(ciaiA.,)/2-logfc^ 

Since fib = (b + 1)^, aifib can be bounded below by a constant e > 0, uniformly in 
& G N. Thus if Cl = 4 log k/e, the claimed bound holds with C 2 = log(A:). □ 


We can now set up a bound for the top eigenvalue. Working on Z/nZ but still 
with general absorption rates: 

(28) P^[t > cimfib] = P^[t > cimfib;im < cimfib] + > cimfid.^m > civnfib] 

(29) < P^T > ^rn] + P^l^m > CilTlflb] 

(30) 


It follows that 

A* = - lim —^— log(P 4 [r > Cimr,,]) > min{c 2 , - log(G6(a:))}. 

m^oo CimTb CiTb 

Since fib = (5+1)^, proving that with b of order — log(Gt,(a;)) is bounded below 
by a positive constant, uniformly in n, will complete the proof. 

Up to now, the kill rates u have been general. Specialize now to the rates for 
the matrix M" with any scrambling of its diagonal. The vector v is given by the 
& + 1 entries of: 


27r\\ 1 


0 , - 1 - cos — , - 1 - cos — , - 1 - cos — 


27r\\ 1 


n J J 3 \ V^/ 

1 / /47r\ 

- I 1 — cos I — 1 


47r\ 


n J 


1 / 

- 1 — cos 


'2-LlJ 


From the definition of Gh at (211 with b = [v^J: a Riemann sum approximation 
gives 

_^2 

lim G&(v) = . 

n—>-oo 

Indeed, 

-log(G6(v)) =^^^log ("l + (6 +!)(&+I-y) ("l-cos 

y—0 ^ \LJ/// 


b-1 




^2l4 


6 ^ 1 


y=o 

b-1 


y + 1 


y=o 

/■! 


1-^1 (y 

b) \b 


2 n^ b ( 

y=0 

/■i 

(1 - y)y^dy 


2rP 

2 


/O 


TT' 


2 
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Combining the pieces, we use /3 for the highest eigenvalue of and thus /3 = 
|/3* — Using this notation, we have shown that ^ < A* = 1 — (| + |/3) ■ Thus 
/3 < 1 ~ ^ ■ This completes the argument and ends the proof of Theorem 


Remarks. The above argument can be modified to handle quite general diagonal 
elements (in particular cos , 0 < j — 1 , needed for the application to the 

Heisenberg random walk). Indeed, for a = o(n), the argument goes through with 
no essential change with b = to show that with diagonal entries cos 
0 < j < n — 1 , the eigenvalue bound 1 — ^ holds (with c > 0 independent of n and 
a). 

The use of Holder’s inequality in (191 is crude. The joint distribution of the local 
times of birth and death processes is accessible (see [SSI). We hope this can be used 
to give sharp results for the constant. Finally we note that the approach to bound (3 
via an associated absorbing Markov chain was used in |^. There, a geometric path 
argument was used to complete the analysis. This gave cruder bounds (/3 < 1 —%) 

na 

but the argument worked for diagonal entries cos for any 1 < a < ^ as well 

as negative eigenvalues. 


6 . A RANDOM WALK ON THE AFFINE GROUP (MOD p) 

Let Ap be the affine group (mod p). Here, p is prime and elements of Ap can be 
represented as pairs (a, &), 1 < a < p — 1 , 0 < 6 < p — 1 

(oi, 6i)(a2,62) = {aia 2 , ai&2 + ^i)- 


All entries are taken mod p. Fix a generator g of the multiplicative group. Let 


5 = {(1,0),(1,1),(1,-1),(p,0),(p-\0)}. 


Set 

(31) 


Q{h) 


I A he S 
0 else. 


Convolution powers of Q converge to the uniform distribution U{h) = p(^p_i'^ ■ We 
use the representation theory of Ap and the analytic results of previous sections to 
show that order p^ steps are necessary and sufficient for convergence. 


Theorem 5. With definitions above, there are positive universal constants ci,C 2 , 
and C 3 such that for all primes p and k> 1 

cie <\\Q* - C/||rv < cae . 

Proof. By the usual Upper Bound Lemma (see | 8 ], Chapter 3): 

4||g*'‘-c/||Tv<5]dp||Q(p)'=f. 

Here, the sum is over nontrivial irreducible representations p of Ap, dp is the dimen¬ 
sion of p, Q{p) = Q{h)p{h) and the norm on the right is the trace norm. There 
are p —1 one dimensional irreducible representations indexed by a G { 1 , 2 ,... ,p— 1 }. 

(32) p„(a,6) = 
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where ct : Z* —?> Zp_i is the group morphism such that a{g) = 1. Then 


QiPa) — 


COS 


27ra 

p-1 


5 5 

There is one (p — 1) dimensional representation p. This may be realized on 

y = {/:{!,2,...,p-l}^C} 


with 

P{a,b)f{j) = f{aj), l<j<p-l. 

It is easy to check directly that p is a representation with character 

fO a ^ 1 

X{a,b) = S-1 a = l, 6^0. 

[p—1 a=l, 6 = 0 

A further simple check shows that (x|x) = J2a b lx(a) = 1 and that y is 

orthogonal to the characters pa in (32|. It follows that {pq}^T\,p is a full set of 
irreducible representations. Choose a basis i5go(-) for y, 0 < a < p — 2. Then, for 
Q in 

/ 1 1 \ 


QM = 5 



1 + 2 cos 




1 < j < p - 1 


Using any of the three techniques above, there is a constant c > 0 such that the 
largest and smallest eigenvalues of Q(p) (in absolute value) are bounded above by 
1 — -. Combining bounds 

Using cos(x) = 1 + ^ + 0{x^), the sum is at most c'^e for universal c^, c^. 
The final term is exponentially smaller proving the upper bound. The lower bound 
follows from the usual second moment method. (See [5] Chapter 3 Theorem 2 for 
details.) Further details are omitted. □ 


Remark. In this example, the matrix Q{p) is again the sum of a circulant and a 
diagonal matrix. Here, the circulant has eigenvalues | cos ; 0 < j < p — 2 and 

the diagonal matrix has entries f + § cos > 1 ^ J < P — 1- The Weyl bounds 
show that the largest and smallest eigenvalues are bounded in absolute value by 
1 — ^ for some fixed 6 > 0. Using this to bound the final term in the upper bound 

/ s 2 /c 

gives (p — 1)^ ( ^ ~ I? ) • This shows that the walk is close to random after order 

p^ log(p) steps. In the Heisenberg examples the Weyl bounds give a bound of 1 
which is useless. 
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The methods above can be applied to other walks on other groups. While we 
won’t carry out the details here, we briefly describe two further examples and point 
to our companion paper [3] for more. 

Example 2 (Borel Subgroup of S'L 2 (IP’p))- Let G be the 2x2 matrices of the form: 

(I € F*, h € Fp -f-)- (ct, 6). 

A minimal generating set (with the identity) is 

S = {id, (g, 0), 0), (1,1), (1, -1)}, g a generator of F*. 

The group has order p{p — 1) with p—1 1-dimensional representations and 4 repre¬ 
sentations of dimension {p — l)/2. They are explicitly described in p. 67. The 
Fourier analysis of the measure Q supported on S is almost the same as the analysis 
for the affine group. The results are that order p^ steps are necessary and sufficient 
for convergence to the uniform distribution. 

Example 3 {M{p^)). There are two nonabelian groups of order the Heisenberg 
group discussed above and M{p^). See [21] Chapter 4 Section 4. One description 
of the latter is: 

M{p^) = {(a, b) : a £ Zp, b G Z^}, (a, b){a', b') = {a + a' ,a*b' + b) 

with a*b= (l-f ap)b (modp^). This group has the same character table as Hi{p). 
It thus has p^ 1-dimensional representations and p—1 representations of dimension 
p. A minimal generating set (for odd p the identity is not needed to take care of 
parity problems) is 

5={(1,0)(-1,0)(0,1),(0,-1)}. 

The Fourier transforms of the associated Q at the p dimensional representations 
have the same form as the matrices in Q with diagonal elements 

/ 27rc \ 

where l<c<p—lis fixed (for the cth representation). We have not carried out 
the details, but, as shown in cni, it is known that order steps are necessary and 
sufficient for convergence. 


a b 
0 


7. Eigenvalues in the Bulk 


Consider the matrix M„(a) as in Q with 


cos 



0 < j < n — 1 


as the diagonal elements. The sections above give bounds on the largest and small¬ 
est eigenvalues. It is natural to give bounds for the empirical measure of all the 
eigenvalues. This is straightforward, using a theorem of Kac-Murdock-Szegb from 
pT] . We use the elegant form of Trotter [30]. If Ai > A 2 > • • ■ > A„ are the 
eigenvalues of M„(a), let 


An 


1 

n 
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Figure 1. The curve shows the eigenvalues predicted by / 2 , while 
the histogram gives the distribution of the actual eigenvalues of 
-^^io,ooo(= Afio,ooo(l))- Note that the curve has a very extreme, 
but finite slope around —1 and 1. For example, although it is 
clear that / 2 (— 1 ) = 0 , the small point on the left of the picture 
corresponds to (—.99,/ 2 (—.99)) « .32 


be the associated empirical measure. To describe the limit let 


(33) 



-1 < a; < 1 
else 


where F 2.1 is the hypergeometric function. Let ^2 be the associated measure. 
Distance between A„ and /X 2 is measured in the d 2 Wasserstein distance: 

d\{Kn,^Ji 2 ) = supi?|VF - Zp with W ~ A„, Z ^ 2 - 


Theorem 6. Let A„ be the empirical measure of the matrix M„(a) with 1 < a < 
n — 1. Let p ,2 be defined by {33}. Then, with a fixed, as n ^ 00 , 


d2(A„, P 2 ) 0. 


See Figure for an example. 


Remark. We have not seen a way to use this kind of asymptotics to bound the 
rate of convergence of a random walk. Indeed our limit theorem shows that the 
distribution of the bulk does not depend on a while previous results show the 
extreme eigenvalues crucially depend on a. 
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Proof. Trotter’s version of the Kac-Murdock-Szego theorem applies to Mn- If 


a{x, y) = cos{2'Kax) + cos(27rj/) 0 < x, y < 1, 

consider cr as a random variable on [0,1]^, endowed with the Lebesgue measure. 
This has distribution cos(27ra{7i) + cos(27r[/2) where Ui and U 2 are independent 
uniform on [0,1]. An elementary calculation shows that cos(27raC/) has an arc-sine 
density f{x) no matter what the integer a is. 


(34) 


f(.x) 


1 

ttVI — 

0 


-1 < X < 1 

else 


Trotter shows that the empirical measure is close to /r 2 , the distribution of a. 
It follows that the empirical measure of the eigenvalues has limiting distribution 
the law of {X + Y)/2 where X and Y are independent with density /(x). This 
convolution has density 


(35) /2(x) 



fr 


min{l,2a:+l} 


max{-1.2x-l} ^(( 2 ^_j,) 2 _i)(^ 2 _i) 


dy 


-1 < X < 1 
else 


The arguement below shows that this integral is in fact 



The integral in (34) is in fact a well known integral in a different guise. Let 
0 ^ fc ^ 1. Define 


K{k) = f 
Jo 


dt 


(37) ^^ov-7 /_ 

" ' lo x/(l-t2)(l-fc2t2) 

This is a complete elliptic integral and equals 


rFr 


1 1 


2.1 A, 


2 2 


;1,F 


(See Section 22.301 of [32].) For ease of notation, we will evaluate 

^min(l,a:+l) 


r 

Mx) = / 

J m; 


^(-i.x-i) \/(l - i^)(l -{x- tY) 
for |x| ^ 2 Making the variable change t > t -I- | it becomes 


.(-f-i.f-1) \fW) 


where 


h{t) = - P), 


1 X , , X 

a=l + -, b=l--. 


This is an even function of x so it is enough to consider when x ^ 0. Then we need 
to evaluate 

/■^ dt 


-b \/Ht) 
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Make the variable change t ^ bt and the integral becomes 

- = -K{k), k 

J-I V(l-t2)(a2-62t2) a 


b 

a 


The factor of 2 comes from the fact that we are integrating an even function from 
— 1 to 1, whereas in (37) the integral is from 0 to 1. Thus 


2 +(^ 2 ’ 2 ’^’ 

Sending x ^ 2x and multiplying by the appropriate constant, we have that the 
integral in (34) is in fact 



7r(l + |a;|) 


i 

1-1. 


U’ 

2’ 

Ai + ki; 


□ 


Remark. j^T] gives a similar expresson for the sum of two general beta variables. 
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